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3 <110> APPLICANT: Puchta, Holger 

4 Biesgen, Christian 

6 <120> TITLE OF INVENTION: Recombination systems and methods for eliminating nucleic 

7 sequences from genome of eukaryotic organisms 
9 <130> FILE REFERENCE: 53262 -20031 . 00/13173 -00010-US 

11 <140> CURRENT APPLICATION NUMBER: US 10/750,891 

12 <141> CURRENT FILING DATE: 2004-01-05 

14 <150> PRIOR APPLICATION NUMBER: PCT/EP02/07281 

15 <151> PRIOR FILING DATE: 2002-07-02 

17 <150> PRIOR APPLICATION NUMBER: DE 101 31 786.7 

18 <151> PRIOR FILING DATE: 2001-07-04 
20 <160> NUMBER OF SEQ ID NOS : 30 

22 <170> SOFTWARE: Patentln version 3.3 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 788 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Saccharomyces cerevisiae 

30 <220> FEATURE: 

31 <221> NAME/KEY: CDS 

32 <222> LOCATION: (62).. (766) 

33 <223> OTHER INFORMATION: open reading frame coding for I-Scel 

35 <400> SEQUENCE: 1 

36 ggatccagta ctgtacctag aatacaaaga agaggaagaa gaaacctcta cagaagaagt 60 

37 g atg aaa aac ate aaa aaa aac cag gta atg aac ctg ggt ccg aac tct 109 
3 8 Met Lys Asn lie Lys Lys Asn Gin Val Met Asn Leu Gly Pro Asn Ser 

39 1 5 10 15 



40 


aaa 


ctg 


ctg 


aaa 


gaa 


tac 


aaa 


tec 


cag 


ctg 


ate 


gaa 


ctg 


aac 


ate 


gaa 


157 


41 


Lys 


Leu 


Leu 


Lys 


Glu 


Tyr 


Lys 


Ser 


Gin 


Leu 


He 


Glu 


Leu 


Asn 


He 


Glu 




42 








20 










25 










30 








43 


cag 


ttc 


gaa 


gca 


ggt 


ate 


ggt 


ctg 


ate 


ctg 


ggt 


gat 


get 


tac 


ate 


cgt 


205 


44 


Gin 


Phe 


Glu 


Ala 


Gly 


He 


Gly 


Leu 


He 


Leu 


Gly 


Asp 


Ala 


Tyr 


He 


Arg 




45 






35 










40 










45 










46 


tct 


cgt 


gat 


gaa 


ggt 


aaa 


acc 


tac 


tgt 


atg 


cag 


ttc 


gag 


tgg 


aaa 


aac 


253 


47 


Ser 


Arg 


Asp 


Glu 


Gly 


Lys 


Thr 


Tyr 


Cys 


Met 


Gin 


Phe 


Glu 


Trp 


Lys 


Asn 




48 




50 










55 










60 












49 


aaa 


gca 


tac 


atg 


gac 


cac 


gta 


tgt 


ctg 


ctg 


tac 


gat 


cag 


tgg 


gta 


ctg 


301 


50 


Lys 


Ala 


Tyr 


Met 


Asp 


His 


Val 


Cys 


Leu 


Leu 


Tyr 


Asp 


Gin 


Trp 


Val 


Leu 




51 


65 










70 










75 










80 




52 


tec 


ccg 


ccg 


cac 


aaa 


aaa 


gaa 


cgt 


gtt 


aac 


cac 


ctg 


ggt 


aac 


ctg 


gta 


349 


53 


Ser 


Pro 


Pro 


His 


Lys 


Lys 


Glu 


Arg 


Val 


Asn 


His 


Leu 


Gly 


Asn 


Leu 


Val 




54 










85 










90 










95 






55 


ate 


acc 


tgg 


ggc 


gec 


cag 


act 


ttc 


aaa 


cac 


caa 


get 


ttc 


aac 


aaa 


ctg 


397 


56 


He 


Thr 


Trp 


Gly 


Ala 


Gin 


Thr 


Phe 


Lys 


His 


Gin 


Ala 


Phe 


Asn 


Lys 


Leu 
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445 



493 



541 



589 



637 



685 



57 






100 










105 










110 






58 


get age 


ctg 


ttc 


ate 


gtt 


aac 


aac 


aaa 


aaa 


acc 


ate 


ccg 


aac 


aac 


ctg 


59 


Ala Ser 


Leu 


Phe 


He 


Val 


Asn 


Asn 


Lys 


Lys 


Thr 


He 


Pro 


Asn 


Asn 


Leu 


60 




115 










120 










125 








61 


gtt gaa 


aac 


tac 


ctg 


acc 


ccg 


atg 


tct 


ctg 


gca 


tac 


tgg 


ttc 


atg 


gat 


62 


Val Glu 


Asn 


Tyr 


Leu 


Thr 


Pro 


Met 


Ser 


Leu 


Ala 


Tyr 


Trp 


Phe 


Met 


Asp 


63 


130 










135 










140 










64- 


gat ggt 


ggt 


aaa 


tgg 


gat 


tac 


aac 


aaa 


aac 


tct 


acc 


aac 


aaa 


teg 


ate 


65 Asp Gly Gly 


Lys 


Trp 


Asp 


Tyr 


Asn 


Lys 


Asn 


Ser 


Thr 


Asn 


Lys 


Ser 


He 


66 


145 








150 










155 










160 


67 


gta ctg 


aac 


ace 


cag 


tct 


ttc 


act 


ttc 


gaa 


gaa 


gta 


gaa 


tac 


ctg 


gtt 


68 


Val Leu 


Asn 


Thr 


Gin 


Ser 


Phe 


Thr 


Phe 


Glu 


Glu 


Val 


Glu 


Tyr 


Leu 


Val 


69 








165 










170 










175 




70 


aag ggt 


ctg 


cgt 


aac 


aaa 


ttc 


caa 


ctg 


aac 


tgt 


tac 


eta 


aaa 


ate 


aac 


71 


Lys Gly Leu 


Arg Asn 


Lys 


Phe 


Gin 


Leu 


Asn 


Cys 


Tyr 


Leu 


Lys 


He 


Asn 


72 






180 










185 










190 






73 


aaa aac 


aaa 


ccg 


ate 


ate 


tac 


ate 


gat 


tct 


atg 


tct 


tac 


ctg 


ate 


ttc 


74 


Lys Asn 


Lys 


Pro 


He 


He 


Tyr 


He 


Asp 


Ser 


Met 


Ser 


Tyr 


Leu 


He 


Phe 


75 




195 










200 










205 








76 


tac aac 


ctg 


ate 


aaa 


ccg 


tac 


ctg 


ate 


ccg 


cag 


atg 


atg 


tac 


aaa 


ctg 


77 


Tyr Asn 


Leu 


He 


Lys 


Pro 


Tyr 


Leu 


He 


Pro 


Gin 


Met 


Met 


Tyr 


Lys 


Leu 


78 


210 










215 










220 










79 


ccg aac 


act 


ate 


tec 


tec 


gaa 


act 


ttc 


ctg 


aaa 


taataagtcg agtactggat 


80 


Pro Asn 


Thr 


He 


Ser 


Ser 


Glu 


Thr 


Phe 


Leu 


Lys 












81 


225 








230 










235 













733 



82 cc 788 

84 <210> SEQ ID NO: 2 

85 <211> LENGTH: 235 

86 <212> TYPE: PRT 

87 <213> ORGANISM: Saccharomyces cerevisiae 

89 <400> SEQUENCE: 2 

90 Met Lys Asn He Lys Lys Asn Gin Val Met Asn Leu Gly Pro Asn Ser 

91 1 5 10 15 

92 Lys Leu Leu Lys Glu Tyr Lys Ser Gin Leu He Glu Leu Asn He Glu 

93 20 25 30 

94 Gin Phe Glu Ala Gly He Gly Leu He Leu Gly Asp Ala Tyr He Arg 

95 35 40 45 

96 Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gin Phe Glu Trp Lys Asn 

97 50 55 60 

98 Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gin Trp Val Leu 

99 65 70 75 80 

100 Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn Leu Val 

101 85 90 95 

102 He Thr Trp Gly Ala Gin Thr Phe Lys His Gin Ala Phe Asn Lys Leu 

103 100 105 110 

104 Ala Ser Leu Phe He Val Asn Asn Lys Lys Thr lie Pro Asn Asn Leu 

105 115 120 125 

106 Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe Met Asp 

107 130 135 140 
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108 


Asp Gly Gly Lys 


Trp 


Asp 


Tyr 


Asn 


Lys 


Asn 


Ser 


Thr Asn Lys 


Ser 


He 


109 


145 












150 










155 










160 


110 


Val 


Leu 


Asn 


Thr 


Gin 


Ser 


Phe 


Thr 


Phe 


Glu 


Glu 


Val 


Glu 


Tyr 


Leu 


Val 


111 












165 










170 










175 




112 


Lys Gly Leu Arg Asn 


Lys 


Phe 


Gin 


Leu 


Asn 


Cys 


Tyr 


Leu 


Lys 


He 


Asn 


113 










180 










185 










190 






114 


Lys 


Asn Lys 


Pro 


He 


He 


Tyr 


He 


Asp 


Ser 


Met 


Ser 


Tyr 


Leu 


He 


Phe 


115 








195 










200 










205 








116 


Tyr 


Asn 


Leu 


He 


Lys 


Pro 


Tyr 


Leu 


He 


Pro 


Gin 


Met 


Met 


Tyr 


Lys 


Leu 


117 




210 










215 










220 










118 


Pro 


Asn 


Thr 


He 


Ser 


Ser 


Glu 


Thr 


Phe 


Leu 


Lys 












119 


225 












230 










235 












121 


<210> 


SEQ ID NO: 


: 3 
























122 


<211> 


LENGTH: 746 
























123 


<212> 


TYPE: 


DNA 


























124 


<213> 


ORGANISM: 


Chlamydomonas applanata 














126 


<220> 


FEATURE: 


























127 


<221> 


NAME/KEY: 


CDS 
























128 


<222> 


LOCATION: 


(54) 


. - (737) 




















129 


<223> 


OTHER 


INFORMATION: 


open reading frame 


of I-Chul with nucl< 


130 






signal 


























132 


<220> 


FEATURE: 


























133 


<221> 


NAME/KEY: 


misc_f eature 


















134 


<222> 


LOCATION: 


(54) 


. . (83) 




















135 


<223> 


OTHER 


INFORMATION: 


: coding 


for 


nuclear 


location signal 




137 


<400> 


SEQUENCE : 


3 
























138 


ctcgagtacc tagaatacaa agaagaggaa gaagaaactc 


tatagaagaa gee atg 


139 
































Met 


140 


































1 


141 


ggt 


cca 


aag 


aaa 


aag 


aga 


aag 


gtt 


ate 


atg 


tea 


tta 


aca 


caa 


caa 


caa 


142 


Gly 


Pro 


Lys 


Lys 


Lys 


Arg 


Lys 


Val 


He 


Met 


Ser 


Leu 


Thr 


Gin 


Gin 


Gin 


143 










5 










10 










15 






144 


aaa 


gac 


tta 


att 


ttc 


gga 


tct 


eta 


ctg 


ggt 


gat 


gga 


aat 


tta 


caa 


act 


145 


Lys 


Asp 


Leu 


He 


Phe 


Gly 


Ser 


Leu 


Leu 


Gly 


Asp 


Gly 


Asn 


Leu 


Gin 


Thr 


146 








20 










25 










30 








147 


ggt 


tea 


gta 


ggt 


agg 


act 


tgg 


cgc 


tat 


cga 


gcg 


etc 


cat 


aaa 


agt 


gag 


148 


Gly 


Ser 


Val 


Gly 


Arg 


Thr 


Trp 


Arg 


Tyr 


Arg 


Ala 


Leu 


His 


Lys 


Ser 


Glu 


149 




35 










40 










45 










150 


cat 


cag 


aca 


tac 


tta 


ttt 


cat 


aag 


tat 


gaa 


ate 


tta 


aag 


ccg 


ctt 


tgt 


151 


His 


Gin 


Thr 


Tyr 


Leu 


Phe 


His 


Lys 


Tyr 


Glu 


He 


Leu 


Lys 


Pro 


Leu 


Cys 


152 


50 












55 










60 










65 


153 


ggc 


gaa 


aat 


act 


etc 


cca 


aca 


gaa 


agt 


ata 


gtg 


ttc 


gac 


gaa 


aga 


aca 


154 


Gly 


Glu 


Asn 


Thr 


Leu 


Pro 


Thr 


Glu 


Ser 


He 


Val 


Phe 


Asp 


Glu 


Arg 


Thr 


155 












70 










75 










80 




156 


aac 


aag 


gag 


gtt 


aaa 


cgt 


tgg 


ttt 


ttc 


aac 


aca 


tta 


ace 


aat 


cct 


tec 


157 


Asn 


Lys 


Glu 


Val 


Lys 


Arg 


Trp 


Phe 


Phe 


Asn 


Thr 


Leu 


Thr 


Asn 


Pro 


Ser 


158 










85 










90 










95 






159 


tta 


aaa 


ttc 


ttc 


gca 


gac 


atg 


ttc 


tac 


aca 


tat 


gac 


caa 


aac 


aca 


caa 


160 


Leu 


Lys 


Phe 


Phe 


Ala 


Asp 


Met 


Phe 


Tyr 


Thr 


Tyr 


Asp 


Gin 


Asn 


Thr 


Gin 



location 



56 



104 



152 



200 



248 



296 



344 



392 
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161 






100 










105 










110 










162 


aaa 


tgg 


gtt 


aaa 


gat 


gta 


cct 


gta 


aag 


gtt 


caa 


aca 


ttc 


tta 


act 


cct 


440 


163 


Lys 


Trp Val 


Lys 


Asp 


Val 


Pro 


Val 


Lys 


Val 


Gin 


Thr 


Phe 


Leu 


Thr 


Pro 




164 




115 










120 










125 












165 


caa 


get 


tta 


gca 


tac 


ttt 


tat 


ata 


gac 


gat 


gga 


gcg 


tta 


aaa 


tgg 


ctt 


488 


166 


Gin 


Ala 


Leu 


Ala 


Tyr 


Phe 


Tyr 


He 


Asp 


Asp 


Gly 


Ala 


Leu 


Lys 


Trp 


Leu 




167 


130 










135 










140 










145 




168 


aat 


aag 


tct 


aac 


get 


atg 


caa 


att 


tgt 


act 


gaa 


agt 


ttc 


agt 


caa 


ggg 


536 


169 


Asn 


Lys 


Ser 


Asn 


Ala 


Met 


Gin 


He 


Cys 


Thr 


Glu 


Ser 


Phe 


Ser 


Gin 


Gly 




170 










150 










155 










160 






171 


ggc 


acg 


att 


egg 


ate 


caa 


aaa 


gca 


eta 


aaa 


acg 


etc 


tat 


aat 


att 


gat 


584 


172 


Gly Thr 


He 


Arg 


He 


Gin 


Lys 


Ala 


Leu 


Lys 


Thr 


Leu 


Tyr 


Asn 


He 


Asp 




173 








165 










170 










175 








174 


aca 


acg 


ttg 


aca 


aaa 


aaa 


act 


eta 


caa 


gac 


ggc 


aga 


att 


ggc 


tat 


cgt 


632 


175 


Thr 


Thr 


Leu 


Thr 


Lys 


Lys 


Thr 


Leu 


Gin 


Asp 


Gly 


Arg 


He 


Gly 


Tyr 


Arg 




176 






180 










185 










190 










177 


ata 


get 


att 


cct 


gaa 


gee 


agt 


age 


ggt 


get 


ttt 


cgt 


gaa 


gtc 


att 


aaa 


680 


178 


He 


Ala 


He 


Pro 


Glu 


Ala 


Ser 


Ser 


Gly 


Ala 


Phe 


Arg 


Glu 


Val 


He 


Lys 




179 




195 










200 










205 












180 


cct 


ttt 


eta 


gtt 


gat 


tgt 


atg 


aga 


tac 


aaa 


gtt 


tct 


gat 


ggc 


aat 


aaa 


728 


181 


Pro 


Phe 


Leu 


Val 


Asp 


Cys 


Met 


Arg 


Tyr 


Lys 


Val 


Ser 


Asp 


Gly 


Asn 


Lys 




182 


210 










215 










220 










225 




183 


ggc 


cac 


ctt 


tagctcgag 






















746 


184 


Gly His 


Leu 






























187 


<210> SEQ ID NO 


: 4 


























188 


<211> LENGTH: 228 


























189 


<212> TYPE: 


PRT 




























190 


<213> ORGANISM: 


Chlamydomonas applanata 
















192 


<400> SEQUENCE: 


4 


























193 


Met 


Gly 


Pro 


Lys 


Lys 


Lys 


Arg 


Lys 


Val 


He 


Met 


Ser 


Leu 


Thr 


Gin 


Gin 




194 


1 








5 










10 










15 






195 


Gin 


Lys 


Asp 


Leu 


He 


Phe 


Gly 


Ser 


Leu 


Leu 


Gly 


Asp 


Gly 


Asn 


Leu 


Gin 




196 








20 










25 










30 








197 


Thr 


Gly 


Ser 


Val 


Gly 


Arg 


Thr 


Trp 


Arg 


Tyr 


Arg 


Ala 


Leu 


His 


Lys 


Ser 




198 






35 










40 










45 










199 


Glu 


His 


Gin 


Thr 


Tyr 


Leu 


Phe 


His 


Lys 


Tyr 


Glu 


He 


Leu 


Lys 


Pro 


Leu 




200 




50 










55 










60 












201 


Cys 


Gly 


Glu 


Asn 


Thr 


Leu 


Pro 


Thr 


Glu 


Ser 


He 


Val 


Phe 


Asp 


Glu 


Arg 




202 


65 










70 










75 










80 




203 


Thr 


Asn 


Lys 


Glu 


Val 


Lys 


Arg 


Trp 


Phe 


Phe 


Asn 


Thr 


Leu 


Thr 


Asn 


Pro 




204 










85 










90 










95 






205 


Ser 


Leu 


Lys 


Phe 


Phe 


Ala 


Asp 


Met 


Phe 


Tyr 


Thr 


Tyr 


Asp 


Gin 


Asn 


Thr 




206 








100 










105 










110 








207 


Gin 


Lys 


Trp 


Val 


Lys 


Asp 


Val 


Pro 


Val 


Lys 


Val 


Gin 


Thr 


Phe 


Leu 


Thr 




208 






115 










120 










125 










209 


Pro 


Gin 


Ala 


Leu 


Ala 


Tyr 


Phe 


Tyr 


He 


Asp 


Asp 


Gly 


Ala 


Leu 


Lys 


Trp 




210 




130 










135 










140 












211 


Leu 


Asn 


Lys 


Ser 


Asn 


Ala 


Met 


Gin 


He 


Cys 


Thr 


Glu 


Ser 


Phe 


Ser 


Gin 




212 


145 










150 










155 










160 
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213 Gly Gly Thr lie Arg lie Gin Lys Ala Leu Lys Thr Leu Tyr Asn lie 

214 165 170 175 

215 Asp Thr Thr Leu Thr Lys Lys Thr Leu Gin Asp Gly Arg lie Gly Tyr 

216 180 185 190 

217 Arg He Ala He Pro Glu Ala Ser Ser Gly Ala Phe Arg Glu Val He 

218 195 200 205 

219 Lys Pro Phe Leu Val Asp Cys Met Arg Tyr Lys Val Ser Asp Gly Asn 

220 210 215 220 

221 Lys Gly His Leu 

222 225 

225 <210> SEQ ID NO: 5 

226 <211> LENGTH: 582 

227 <212> TYPE: DNA 

228 <213> ORGANISM: Chlamydomonas reinhardtii 

230 <220> FEATURE: 

231 <221> NAME/KEY: CDS 

232 <222> LOCATION: (55) . . (573) 

233 <223> OTHER INFORMATION: openreading frame coding for I-Crel with nuclear 

234 location signal 
236 <220> FEATURE: 

23 7 <221> NAME/KEY: misc_f eature 

238 <222> LOCATION: (55).. (84) 

239 <223> OTHER INFORMATION: coding for nuclear location signal 
241 <400> SEQUENCE: 5 



242 


ctcgagtacc tagaatacaa agaagaggaa gagaaacctc 


taccagaaga agee 


atg 


57 


243 
































Met 




244 
































1 




245 


ggt 


cca 


aag 


aaa 


aag 


aga 


aag 


gtt 


ate 


atg 


aat 


aca 


aaa 


tat 


aat 


aaa 


105 


246 


Gly 


Pro 


Lys 


Lys 


Lys 


Arg 


Lys 


Val 


He 


Met 


Asn 


Thr 


Lys 


Tyr 


Asn 


Lys 




247 








5 










10 










15 








248 


gag 


ttc 


tta 


etc 


tac 


tta 


gca 


999 


ttt 


gta 


gac 


ggt 


gac 


ggt 


age 


ata 


153 


249 


Glu 


Phe 


Leu 


Leu 


Tyr 


Leu 


Ala 


Gly 


Phe 


Val 


Asp 


Gly 


Asp 


Gly 


Ser 


He 




250 






20 










25 










30 










251 


ate 


get 


caa 


att 


aag 


cct 


aat 


cag 


tct 


tat 


aaa 


ttt 


aag 


cat 


cag 


eta 


201 


252 


He 


Ala 


Gin 


He 


Lys 


Pro 


Asn 


Gin 


Ser 


Tyr 


Lys 


Phe 


Lys 


His 


Gin 


Leu 




253 




35 










40 










45 












254 


tea 


etc 


gcg 


ttc 


caa 


gtc 


acg 


caa 


aag 


aca 


cag 


aga 


cgt 


tgg 


ttt 


tta 


249 


255 


Ser 


Leu 


Ala 


Phe 


Gin 


Val 


Thr 


Gin 


Lys 


Thr 


Gin 


Arg 


Arg 


Trp 


Phe 


Leu 




256 


50 










55 










60 










65 




257 


gac 


aaa 


tta 


gtg 


gat. 


gaa 


att 


ggg 


gtt 


ggt 


tat 


gta 


aga 


gat 


agg 


ggt 


297 


258 


Asp 


Lys 


Leu 


Val 


Asp 


Glu 


He 


Gly 


Val 


Gly 


Tyr 


Val 


Arg 


Asp 


Arg 


Gly 




259 










70 










75 










80 






260 


age 


gtt 


teg 


gat 


tat 


att 


eta 


age 


gaa 


ate 


aag 


cct 


ttg 


cat 


aat 


ttt 


345 


261 


Ser 


Val 


Ser 


Asp 


Tyr 


He 


Leu 


Ser 


Glu 


He 


Lys 


Pro 


Leu 


His 


Asn 


Phe 




262 








85 










90 










95 








263 


tta 


aca 


caa 


eta 


caa 


cct 


ttt 


eta 


aaa 


eta 


aaa 


caa 


aaa 


caa 


gca 


aat 


393 


264 


Leu 


Thr 


Gin 


Leu 


Gin 


Pro 


Phe 


Leu 


Lys 


Leu 


Lys 


Gin 


Lys 


Gin 


Ala 


Asn 




265 






100 










105 










110 










266 


tta 


gtt 


tta 


aaa 


att 


att 


gaa 


caa 


ctt 


ccg 


tea 


gca 


aaa 


gaa 


tec 


ccg 


441 
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